3574 results found.
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CC BY 4.0
Size:
60000 hours Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:Improving Unsupervised Sparsespeech Acoustic Models with Categorical Reparameterization
-
Paper track:10.8 Zero-resource speech recognition/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Benjamin Milde | Libri-Light | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
FDP data transfer form
Size:
100 hours Production Status:
Newly created-in progress
Use:
Emotion Recognition/Generation
-
Paper title:An Efficient Temporal Modeling Approach for Speech Emotion Recognition by Mapping Varied Duration Sentences into Fixed Number of Chunks
-
Paper track:3.1 Analysis of speaker states/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Carlos Busso | The MSP-Podcast corpus | /N |
Documentation:
English
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
4.4 GByte Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:Risk Forecasting from Earnings Calls Acoustics and Network Correlations
-
Paper track:10.9 Other topics in Speech Recognition -Technolog/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ramit Sawhney | EarningsCall_Dataset | /N |
Documentation:
Documentation is available in English: Qin, Yu., & Yang, Yi. (2019). What You Say and How You Say It Matters: Predicting Stock Volatility Using Verbal and Vocal Cues. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics.
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
MIT License
Size:
200 minutes Production Status:
Newly created-finished
Use:
Speech Recognition/Understanding
-
Paper title:Detecting and Counting Overlapping Speakers in Distant Speech Scenarios
-
Paper track:5.4 Speech and audio segmentation/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Samuele Cornell | OSDC | /N |
Documentation:
Available at https://github.com/popcornell/OSDC in English
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
FDP data transfer form
Size:
100 hours Production Status:
Newly created-in progress
Use:
Emotion Recognition/Generation
-
Paper title:Ensemble of Students Taught by Probabilistic Teachers to Improve Speech Emotion Recognition
-
Paper track:3.3 Automatic analysis of speaker states/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Carlos Busso | MSP-Podcast corpus | /N |
Documentation:
English
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
13 million sentences Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Improved hybrid streaming ASR with Transformer language models
-
Paper track:8.5 Novel neural network architectures (e.g. seque/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Pau Baquero-Arnal | TED-LIUM | /N |
Documentation:
There is publicly available documentation in English
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
982.1 hours Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Improved hybrid streaming ASR with Transformer language models
-
Paper track:8.5 Novel neural network architectures (e.g. seque/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Pau Baquero-Arnal | LibriSpeech | /N |
Documentation:
There is publicly available documentation in English
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CDLA-Permissive
Size:
5 hours Production Status:
Newly created-finished
Use:
Speech Recognition/Understanding
-
Paper title:DiPCo - Dinner Party Corpus
-
Paper track:8.3 Robustness against noise or reverberation/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Roland Maas | DiPCo - Dinner Party Corpus | /N |
Documentation:
https://arxiv.org/pdf/1909.13447.pdf
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
License:
Size:
None Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Multilingual Jointly Trained Acoustic and Written Word Embeddings
-
Paper track:9.9 Cross-lingual and multilingual components for /Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shane Settle | The Switchboard-1 Telephone Speech Corpus | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
15 dysarthric speakers, 13 control speakers OtherProduction Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Domain Adversarial Training for Dysarthric Speech Recognition
-
Paper track:10.9 Other topics in Speech Recognition -Technolog/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Dominika Woszczyk | Universal Access Speech database | /N |
Documentation:
http://www.isle.illinois.edu/sst/data/UASpeech/




